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"Abstract 

We investigate the possibility that the W+4 jets background is a source of the anomalously large top-antitop (ft) charge 
^vq 'asymmetry observed at the Tevatron. We simulate the ti reconstruction of the signal and background events at the 
matrix-element level and find that the reconstructed tt candidates from the W+4 jets background could give large 
forward-backward asymmetry. We suggest serious re-evaluation of the W+4 jets background for the tt candidate events 
by studying the distributions of the reconstructed tt systems in their rapidity difference, the transverse momentum of 
^5 the top quark, and that of the tt system, which can differ significantly from those of the QCD tt production events 
CO especially at high tt invariant mass. 
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1. Introduction 

Since CDF[1] and D0[2] experiments reported the sig- 
nificant deviation from the next-to-leading order (NLO) 
prediction[3, 4, 5, 6] in the forward-backward asymmetry 
of top-antitop (tt) productions in pp collisions, the origin of 
the discrepancy has been one of the serious issues in high- 
energy physics. Contributions from the NNLL QCD [7] and 
the QED higher order corrections [8] have been discussed, 
but they are not sufficient to explain the anomaly. Al- 
though both experiments seem to be confident that the 
large asymmetry is originated in the ti signal events, there 
may be room for the background events to be responsible 
for the large positive asymmetry in view of the complex- 
ity of the experimental analysis and our imperfect under- 
standing of multi-jet events at hadron colliders. 

In this short report we investigate the possibility that 
the W+4 jets background, the main background for the tt 
production process, is a source of the disagreement in the 
top forward-backward asymmetry. We simulate the ex- 
perimental event-selection cuts for the tt-candidate events 
using the event samples generated with exact leading-order 
matrix elements. It is then found that the reconstructed 
ti candidates from W+4 jets background can give rise 
to the large forward-backward asymmetry, especially at 
high ti invariant mass. We present kinematic distribu- 
tions at the parton level because the smearing due to sub- 
sequent parton showering is not fully understood for multi- 
jet processes at hadron colliders, as hinted by the discrep- 
ancy in the di-jet invariant mass distribution in W+2 jets 



events studied at the Tevatron[9, 10]. The following re- 
sults should hence be regarded as hints of what could be 
observed in real experiments after smearing due to QCD 
shower development and hadronization, detector resolu- 
tions, and event reconstruction complexities. 

2. Event selection and reconstruction 

In this section we summarize the parton-level event gen- 
eration and the ^-reconstruction procedures to obtain the 
ti signal and its W+4 jets background distributions. We 
use MadGraph[ll] and its GPU version[12, 13, 14] to gen- 
erate the following three sets of events: 



pp^ti^ (l+vi b) (b2j), 

pp ->■ W + bb2j -> (l + vi) bb2j, 
pp -> W + 4j -> (l + vi) 4j, 



(1) 

(2) 
(3) 
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where I is an electron or muon, while j can be a gluon or 
a light (u, d, s, c) quark or antiquark. Sample (1) gives ti 
signal events while (2) and (3) contribute to the W+4 jets 
background. We use CTEQ6L1 parton distribution func- 
tions and set the renormalization and factorization scales 
at half of the ti invariant mass (M t i) for tt events (1) 
and at 20 GcV for the W+4 jets events (2) and (3). Al- 
though these are parton level samples, we regard each gen- 
erated gluons and quarks as a jet. The generated ti sample 
does not have NLO contributions and gives no forward- 
backward asymmetry. However, the asymmetry predicted 
by the Standard Model (SM) at NLO is less than the ex- 
perimental resolutions of CDF and DO experiments [1, 2], 
and hence we ignore QCD higher order corrections in this 
study. 



Generated samples are then passed to the event selection 
procedure. Firstly they are required to satisfy the follow- 
ing kinematic cuts, which simulate the typical tt event- 
selection cuts at the Tevatron[2]: 

PTj > 20GeV, 1*7,-1 < 2.5, (4a) 

max{p Tj } > 40GeV, (4b) 

Pti > 20GeV, \ m \ < 2.0, (4c) 

ARjj > 0.5, ARji > 0.5, AR H > 0.5, (4d) 

^ T >20GeV, (4e) 

where px and r] are the transverse momentum and pseudo- 
rapidity, respectively; j can be a gluon, quark or antiquark; 
and I represents an electron or muon. Here AR a b is the 
distance in the (pseudo-rapidity)-(azimuthal-angle) plane, 



AR ab = V Via - Vb) 2 + (0o - <kb) 2 '■, 



(5) 



and p is defined as the magnitude of the three-momentum 
of the neutrino, 

f T = \PTv l Y (6) 

We then select tt candidate events with a lepton + 4 jets 
+ large p by requiring the following three conditions 



one electron or muon, 

at least 4 jets, 

one or two 6— tagged jets. 



(7a) 
(7b) 
(7c) 



When applying the requirement (7c), we assume 70% b- 
tagging efficiency and 8% probability of miss-identifying a 
light jet as a 6-jet [2] . Therefore, 84.6% of the signal (1) and 
lvbb2j background events (2) survive the condition (7c), 
while 31.4% of the Iv Aj background events (3) remain. It 
turns out that the Iv Aj events with one or two miss-tags 
give the dominant background to the tt signal events. 

We then reconstruct pairs by following the constrained 
kinematic fit [2] employed in analyses of top physics at the 
Tevatron. The lepton, missing transverse momentum (p T ) 
and leading four jets are used for the reconstruction. The 
neutrino pr is reconstructed as the missing px, but its ra- 
pidity is not fully determined from the W mass constraint, 
leaving two possible neutrino four-momenta in the zero W- 
width limit. Two of the leading four jets are assigned to 
the b and b quark while the others should form the W bo- 
son. In general, there are 24 possible combinations of the 
two neutrino momenta and jet assignments. By assigning 
a 6-tagged jet to the b or b quark, this number is reduced 
to twelve for single-6-tag events, and to four for double- 
6-tag events. Among all these possible combinations, we 
select one combination for the reconstructed tt candidate 
by minimizing the x 2 function[15] defined as 
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where Mw, M t , IV, are the mass and width of a W 
boson and top quark: 80.4 GeV, 172.5 GeV, 2.08 GeV 
and 1.35 GeV, respectively. Mjj, Mi v , Mbjj and M^iv are 
the invariant mass of the two jets that are assigned to a 
W boson (Wjets), the lepton- neutrino, the 6- Wjets, and 
the 6-lepton-neutrino system, respectively. The top and 
antitop quarks are identified by the lepton charge. 

For each selected event, the rapidity difference (Ay) be- 
tween the reconstructed top and antitop quarks is defined 
as 

Ay = y t -yt = q(yt,\ep- Vt, had), (9) 

where q is the sign of the lepton charge, and yt,i ep and 
2/t,had are the rapidities of the leptonically and hadronically 
decayed top quarks, respectively. The forward-backward 
asymmetry is then defined as 



Att _ 

A FB — 



N(Ay > 0) - N(Ay < 0) 
N(Ay > 0) + N(Ay < 0) ' 



(10) 



(8) 



where N(Ay > 0) and N(Ay < 0) arc the number of 
events with positive and negative rapidity difference, re- 
spectively. This asymmetry is evaluated for the event sam- 
ples (1), (2) and (3) separately. 

3. Results 

In this section we discuss kinematic distributions of 
the reconstructed tt candidates in the tt sample (1) and 
W+A jets background samples (2) and (3) described above. 
Fig. 1 shows the M t i distribution (upper) and M t j depen- 
dence of A l p B (lower). In the upper plot, the solid red, 
dashed blue and dotted green histograms show the differ- 
ential cross sections in fb/GeV unit after event selection 
for the tt signal (1), W+Aj (3) and W+bb+2j (2) back- 
ground samples, respectively. Because of the large cross 
section of the W+A light-jets process that pass the first se- 
lection cuts of eqs.(4), 5.3pb with the fixed QCD coupling 
of a s = 0.171, the sample (3) gives the major background 
contribution of 1.4 pb even after the 6-tagging require- 
ment (7c). W+bb+2j sample (2) gives 0.10 pb, while the 
leading-order tt signal (1) gives 1.8 pb. All the distribu- 
tions peak at M t i ~ 400 GeV and decrease exponentially 
in high M t j region. The cross section of the W+Aj back- 
ground decreases more slowly than the other two samples. 
In the lower plot, the forward-backward asymmetry of the 
tt, W+Aj and W+bb+2j samples are shown separately as 
the circle-red, triangle-blue and rectangular-green points, 
respectively, with their error bars for the M t i bins of 350- 
450, 450-550, 550-650 and 650-950 GeV. As expected, the 
asymmetry of the tt sample is consistent with zero in all 
bins since the sample is generated at tree level. The asym- 
metries of the W+Aj and W+bb+2j samples have finite 
values and increase monotonically with M t i; the W+Aj 
sample gives the largest asymmetry in all bins with values 
up to 0.15, while the W+bb+2j sample gives the asymme- 
try of 0.02 to 0.04 as M a grows from the 400 GeV bin to 
the 800 GeV bin. 
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Figure 1: Upper: M t j distributions of the reconstructed tt candi- 
dates in fb/GeV unit. The solid red, dashed blue and dotted green 
histograms show the differential cross sections after event selection 
in the tt, W+4j and W+bb+2j samples, respectively. Lower: the 
M t i dependence of the top forward-backward asymmetries of the re- 
constructed tt candidates. Asymmetries are evaluated in each bin; 
those of the last three bins are unified. 

The positive asymmetry in the 1^+4 jets events reflects 
the tendency of the W + emission along the proton di- 
rection. We show in Fig. 2 the Ay distributions of the 
reconstructed tt candidates in the tt and VF+4jets sam- 
ples. Here and below, the "W / +4jets" sample is the sum 
of the W+bb+2j (2) and W+4j samples (3). The solid red 
histogram is for the tt signal sample, which is symmetric 
about Ay = as expected for the tree-level QCD predic- 
tion. The dashed blue histogram gives the distribution 
of the total W / +4jets background sample. Furthermore, 
the W+4jets sample is divided into three sub-samples, ac- 
cording to the jet component of the reconstructed leptonic- 
top candidates: i + -f-gluon (l + vg), Z + -^-quark {l + vq) and 
£+-z/-antiquark {l + vq). The solid green, dash-dotted blue 
and dotted orange histograms show contributions of the 
l + vg, l + vq and l + vq sub-samples, respectively. Those his- 
tograms are normalized such that their integrals equal to 
the cross sections after event selection for the tt candidates. 
In the upper plot for M t j > 450 GeV, the Fy-|-4jets dis- 
tribution clearly shows the positive asymmetry with two 
peaks around | Ay\ ~ 1.5 while the real tt distribution has 
one peak around |Ay| ~ 0. The l + vq and l + vq sub-sample 
distributions have their peak at Ay ~ 1.5 and -1.5, re- 
spectively, while the l + vg sub-sample shows the two-peak 
distribution with the positive asymmetry. The positive 
asymmetry in the total M / +4jets distribution as well as 
in the l + vg and l + vq sub-samples are also recognizable in 
the lower plot for M ti < 450 GeV. 

The W + emitted from a hard up quark in the proton 
often combines with the down quark in the u — > dW + 
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Figure 2: Ay distributions in fb unit. The W+A jets background 
is the sum of the W+bb+2j (2) and W+4j (3) samples. The solid 
green, dash-dotted blue and dotted orange histograms show the con- 
tributions of sub-samples l+vg, l+vq and l+vq in the W+4 jets sam- 
ple, where the leptonic top quark is reconstructed from a Z + -y-gluon, 
i^-zz-quark and i + -i/-antiquark system, respectively. Histograms are 
normalized to the cross sections after event selection. 

transition to form a fake "top" quark as a ^ + -^-quark com- 
bination in Fig. 2, which tends to have the rapidity in the 
forward direction especially at high M t i. The forward- 
emitted W + sometimes combines with a relatively 'soft' 
gluon to form a l + vg top candidate. The asymmetries of 
the W+4 jets sample at high and low M t j in Fig. 2 are 
11% and 6%, respectively, large enough to account for the 
observed asymmetry[l, 2] when combined with the QCD 
NLO prediction[3, 4, 5, 6] for the tt signal events. However, 
such significant asymmetry in the W+A jets background 
events has not been reported by CDF[1] and DO [2] collab- 
orations. 

We note here that the x 2 function of eq. (8) has been 
used to fix the combinatoric ambiguities only, and all the 
events that pass the selection cuts of eqs. (4) and (7) are 
used to obtain the above distributions. In real experi- 
ments, the reconstructed Mi„, Mjj, Mu v and M^jj which 
appear in eq. (8) distribute according to detector resolu- 
tion and jet-finding algorithms, and can also depend on 
M t i since the average energy of the lepton and jets should 
be large for high Mjj. We therefore examine the invariant 
mass distribution of the reconstructed top candidates in 
Fig. 3 with the same symbols as in Fig. 2. Even though 
none of the samples contains a real top quark, all the dis- 
tributions have a peak around 150 — 200 GeV, at both 
M t i > 450 GeV and M ti < 450 GeV. Although in real 
experiments the parton-level invariant mass distributions 
in Fig. 3 should be smeared by hadronization and detector 
resolution as well as by the jet-finding process, the proba- 
bility that the lower (<150 GeV) and higher (>200 GeV) 
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Figure 3: Invariant mass distributions of the reconstructed leptonic 
top in fb/GeV unit. The solid green, dash-dotted blue and dotted 
orange histograms show the contributions of sub-samples l+ug, l+vq 
and l+vq in the W+4 jets sample, where the leptonic top quark 
is reconstructed from a i + -^-gluon, i + -i^-quark and i + -^-antiquark 
system, respectively. Histograms are normalized to the cross sections 
after event selection. 

invariant mass events at the matrix-element level remain 
as the top quark candidates in experiments would be lower 
than that of those events in the 'top' mass region, 150 
GeV< Mi V j < 200 GeV. We therefore examine all the dis- 
tributions after removing events where the reconstructed 
mass (Mi„j) is below 150 GeV or above 200 GeV. None 
of the distributions shown in this report is affected signif- 
icantly. For example, the forward-backward asymmetries 
for M ti > 450 GeV and M t f < 450 GeV after the M lv j cut 
are 13 % and 5%, respectively, as compared to 11% and 
6%, respectively, reported above from Fig. 2. 

We further note that \V US \ 2 ~ 5% of the forward l + vq 
candidates have the s-quark, and \V u b\ 2 ~ 0.002% the b- 
quark. Although these events may be statistically insignif- 
icant, jet flavor dependence of the 5-(miss) tagging prob- 
ability may be worth studying in the (Zi/j^-top-candidate 
events. All in all, we would like Tevatron experiments 
to look for the positive asymmetry in the W+ jets back- 
ground events as a prediction of the SM at least in the 
matrix-element level. We believe that such serious studies 
of the background events will shed light on the ti asym- 
metry issue. 

In order to aid further discrimination of the W+4 jets 
background events from the tt signal, we examine two 
other kinematic distributions which are expected to dif- 
fer significantly between the signal and the background. 
One is the transverse momentum distribution of the re- 
constructed top quark, as shown in Fig. 4 with the same 
symbols as in Fig. 2, i.e., solid red lines for the tt signal and 
dashed blue lines for the W+4 jets background. The dis- 
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Figure 4: The transverse momentum distributions of the recon- 
structed top-quark candidates in fb/GeV unit. The solid red his- 
tograms show the distributions of the tt signal events, and the dashed 
blue histograms give those of the W+4 jets background events for 
M t f > 450 GeV (upper) and < 450 GeV (lower). Histograms are 
normalized to the cross sections after event selection. 

tribution of the W+4 jets events is softer than that of the 
tt signal events; this is because the real top quark is pro- 
duced via the s-channel qq annihilation, whereas the top 
quark candidate in the W+4 jets sample is formed with 
jets which are dominantly emitted via the t-channel gluon 
exchange. The difference between the ti and W+4 jets 
distribution is more evident at high M t i region since the 
mean top quark pr from qq annihilation grows linearly as 
Mti, while that from W+ jets background grows only log- 
arithmically. The pt of the reconstructed top quark may 
be a good discriminant between the signal and background 
especially at high M t i. 

We also compare the transverse momentum distribution 
of the reconstructed tt system, pr(tt). If W+4 jets back- 
ground is significant at high M t i region, the reconstructed 
tt system should have significantly softer initial-state ra- 
diation than what is expected for the standard qq annihi- 
lation processes for the tt production. This is because the 
hard scale of the W+4 jets process is set by the smallest 
Pt of the selected jets, ~ 20 GeV, as compared to that of 
the tt production process, ~ M t j/2. We try to simulate 
the difference using MadGraph[ll] and Pythia[16]. For the 
ti signals, we use the standard MLM matching scheme[17] 
with tt+l-jet matrix elements for stable t and t quarks. We 
allow Pythia to generate only initial-state radiation using 
the pT-ordered shower [16]. The dashed, solid and dotted 
red histograms in Fig. 5 are obtained for the factorization 
scale of Q = M t i/n with n = 1,2 and 4, respectively, when 
the parton shower is matched to the ti+ 1-jet matrix ele- 
ments at pr(tt) — M t t/6. For the W+4 jets background, 
the dotted blue histogram is obtained with the scale of 
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Figure 5: The transverse momentum distributions of the recon- 
structed tt system in an arbitrary unit. Q is the factorization scale 
for the initial-state parton shower[16]. All histograms are normalized 
such that their integrals equal to one. 

Q = 10 GeV, half of the jet selection pt cut. The solid 
blue histogram is obtained where the initial-state shower is 
matched to the W+5 jets matrix elements when the 5th-jet 
Pt is 10 GeV. Since we suppress final-state parton shower, 
our simulation is far from realistic; we believe, however, 
that the qualitative features of the difference in the initial- 
state-radiation effects between the tt production and W+4 
jets events should be as large as indicated in Fig. 5, if one 
could separate final-state jets from initial-state ones. We 
therefore suggest that the softer-than-predicted pr (tt) dis- 
tribution reported by DO [2] can be a hint of the significant 
contamination of the W+4 jets background in their can- 
didate events. Serious studies on the initial-state-radiation 
effects on the W+4 jets backgrounds are hence desired in 
addition to the NLO contributions of the ti+ jet process 
as recently discussed in Ref.[18]. 



4. Discussions and Conclusion 

In this report we have studied properties of the W+4 jets 
background for the tt production events at the Tevatron 
by simulating the tt event selection cuts to the background 
events at the matrix-element level. We find that those 
W+4 jets background events which pass the selection cuts 
have the following peculiar properties: 

• The rapidity difference (Ay = y t — yi) distribution is 
broader than that of the tt events and has the large 
positive asymmetry at high M t ^. 

• The transverse momentum distribution of the top 
quark candidates is significantly softer than that of 
the real top quarks, especially at large reconstructed 
M fI . 



• The transverse momentum distribution of the recon- 
structed tt system is softer in W+4jets sample than 
that in the real tt sample. 

We suggest that the discrepancy in the pr(tt) distribution 
found by DO collaboration [2] may be explained by the sig- 
nificant contamination of the W+4 jets backgrounds. 

Since our analysis is based on the over-simplified identi- 
fication of final-sate partons as jets, the distributions pre- 
sented in this report should not be regarded as realistic. 
Nevertheless, we would like to suggest experimentalists 
to re-examine their signal-to-background discriminants by 
using the matrix-element- level predictions for Ay, py(top) 
and pT(tt) distributions as hints of possible background 
shapes, by noting that we do not yet have full control on 
the matched parton-shower predictions for multi-jet events 
at hadron colliders. 
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